Search CORE

62 research outputs found

Benchmarking of protein descriptor sets in proteochemometric modeling (part 2): modeling performance of 13 amino acid descriptor sets.

Author: Bender A.
Cortes-Ciriano I.
IJzerman A.P.
Overington J.P.
Swier R.F.
Vlijmen H. van
Wegner J.K.
Westen G.J.P. van
Publication venue
Publication date: 01/01/2013
Field of study

Background While a large body of work exists on comparing and benchmarking descriptors of molecular structures, a similar comparison of protein descriptor sets is lacking. Hence, in the current work a total of 13 amino acid descriptor sets have been benchmarked with respect to their ability of establishing bioactivity models. The descriptor sets included in the study are Z-scales (3 variants), VHSE, T-scales, ST-scales, MS-WHIM, FASGAI, BLOSUM, a novel protein descriptor set (termed ProtFP (4 variants)), and in addition we created and benchmarked three pairs of descriptor combinations. Prediction performance was evaluated in seven structure-activity benchmarks which comprise Angiotensin Converting Enzyme (ACE) dipeptidic inhibitor data, and three proteochemometric data sets, namely (1) GPCR ligands modeled against a GPCR panel, (2) enzyme inhibitors (NNRTIs) with associated bioactivities against a set of HIV enzyme mutants, and (3) enzyme inhibitors (PIs) with associated bioactivities on a large set of HIV enzyme mutants. Results The amino acid descriptor sets compared here show similar performance ( 0.3 log units RMSE difference and >0.7 difference in MCC). Combining different descriptor sets generally leads to better modeling performance than utilizing individual sets. The best performers were Z-scales (3) combined with ProtFP (Feature), or Z-Scales (3) combined with an average Z-Scale value for each target, while ProtFP (PCA8), ST-Scales, and ProtFP (Feature) rank last. Conclusions While amino acid descriptor sets capture different aspects of amino acids their ability to be used for bioactivity modeling is still – on average – surprisingly similar. Still, combining sets describing complementary information consistently leads to small but consistent improvement in modeling performance (average MCC 0.01 better, average RMSE 0.01 log units lower). Finally, performance differences exist between the targets compared thereby underlining that choosing an appropriate descriptor set is of fundamental for bioactivity modeling, both from the ligand- as well as the protein side

Leiden University Scholary Publications

Applications of proteochemometrics - from species extrapolation to cell line sensitivity modelling

Author: Bender A.
Cortes-Ciriano I.
Lenselink E.B.
Malliavin T.E.
Murrell D.S.
Westen G.J.P. van
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2015
Field of study

Medicinal Chemistr

Springer - Publisher Connector

PubMed Central

Leiden University Scholary Publications

Perspective on oncogenic processes at the end of the beginning of cancer genomics

Author: Akbani R
Bailey MH
Bertrand D
Chen F
Cherniack AD
Colaprico A
Cortes-Ciriano I
Ding L
Getz G
Gibbs DL
Hoadley KA
Huang K-L
Hutter CM
Jayasinghe R
Kim J
Lazar AJ
Mills GB
Nagarajan N
Olsen C
Porta-Pardo E
Shmulevich L
Stuart JM
Sun S
Suphavilai C
Taylor AM
Thorsson V
Tokheim C
Vincent BG
Weerasinghe A
Wendl MC
Wheeler DA
Wyczalkowski MA
Yu L
Zenklusen JC
Publication venue: 'Elsevier BV'
Publication date: 13/03/2018
Field of study

The Cancer Genome Atlas (TCGA) has catalyzed systematic characterization of diverse genomic alterations underlying human cancers. At this historic junction marking the completion of genomic characterization of over 11,000 tumors from 33 cancer types, we present our current understanding of the molecular processes governing oncogenesis. We illustrate our insights into cancer through synthesis of the findings of the TCGA PanCancer Atlas project on three facets of oncogenesis: (1) somatic driver mutations, germline pathogenic variants, and their interactions in the tumor; (2) the influence of the tumor genome and epigenome on transcriptome and proteome; and (3) the relationship between tumor and the microenvironment, including implications for drugs targeting driver events and immunotherapies. These results will anchor future characterization of rare and common tumor types, primary and relapsed tumors, and cancers across ancestry groups and will guide the deployment of clinical genomic sequencing

Spiral - Imperial College Digital Repository

Intersection of diverse neuronal genomes and neuropsychiatric disease: The Brain Somatic Mosaicism Network

Author: Abyzov A
Akbarian S
Bae T
Barton Ar
Bekiranov S
Bohrson Cl
Burbulis Ie
Chess A
Chronister W
Coppola G
Cortes-Ciriano I
Courchesne E
D'Gama Am
Daily K
Emery Sb
Erwin Ja
Fasching L
Flasch Da
Freed D
Frisbie Tj
Gage Fh
Ganz J
Gao T
Gleeson Jg
Gulyás-Kovács A
Haakenson M
Jaffe Ae
Keil Jm
Kidd Jm
Kopera Hc
Kwan Ky
Kwon M
Lam Mm
Lee Ea
Lehner T
Lodato Ma
Marques-Bonet T
Mathern Gw
Mcconnell Mj
Mills Re
Moldovan Jb
Moran Jv
Oetjens Mt
Omberg L
Paquola Acm
Park Pj
Peters Ma
Pevsner J
Pochareddy S
Pramparo T
Ratan A
Rodin Re
Rosenbluh C
Sanavia T
Senthil G
Sestan N
Sherman Ma
Shi L
Shin Jh
Skarica M
Song S
Straub Re
Thorpe J
Urban Ae
Vaccarino Fm
Walsh Ca
Wang J
Wang M
Wang Y
Weinberger Dr
Wierman M
Wolpert M
Woodworth M
Zhao X
Zhou B
Zhou W
Publication venue: 'American Association for the Advancement of Science (AAAS)'
Publication date: 01/01/2017
Field of study

Institutional Research Information System University of Turin

Machine learning and data mining frameworks for predicting drug response in cancer:An overview and a novel <i>in silico</i> screening process based on association rule mining

Author: Aas
Abrams
Agrawal
Alexander Polyzos
Alexandrov
Ali
Aliper
Ammad-ud-din
Andersson
Andreas Ntargaras
Antoniou
Aristotelis Tsirigos
Athanassios Kotsinas
Azuaje
Baldari
Barretina
Bartkova
Beesley
Bengio
Bertacchini
Bishop
Blachly
Blumenschein
Breiman
Breiman
Brookshear
Byers
Byron
Campbell
Canela
Caponigro
Carracedo
Chang
Chen
Chen
Chiu
Cortes
Corte´s-Ciriano I., van Westen, G.J., Bouvier, G., et al.
Costello
Coudray
Crespi
Creswell
Cuadrado
Daemen
Das
Das Thakur
Day
Dev
Dhillon
Di Micco
Dietterich
Dimitris Thanos
Eisfeld
Enslen
Evangelou
Falgreen
Fang
Fey
Filippos Koinis
Forbes
Friedman
Frismantas
Galanos
Galanos
Garnett
Geeleher
George-Romanos P. Foukas
Gillet
Gorgoulis
Guinney
Gupta
Haar
Haeuw
Halazonetis
Hanahan
Hanahan
Hastie
Henderson
Hills
Hinton
Hinton
Hinton
Hoadley
Holland
Hua Zhou
Hui
Hussmann
Iannis Aifantis
Iorio
James
Jang
Jin
Jiri Bartek
Kanda
Karakaidos
Kastenhuber
Kelland
Kiaris
Kim
Kim
Kleppmann
Knudson
Koinis
Komseli
Konstantinos Vougas
Kragelj
Lacombe
Laplante
LeCun
Lee
Leonidas Alexopoulos
Li
Li
Liang
Libbrecht
Liontos
Lior
Liu
Liu
Logue
Long
Lovitt
Lu
Lunn
Luo
Maron
Masica
McCain
McCulloch
Mehta
Mendelsohn
Menden
Menden
Meng
Milligan
Min
Mirman
Moghaddas Gholami
Muller
Murase
Negrini
Nelder
Neto
Nicolau
Nidheesh
Niepel
Noll
Noordermeer
Núñez-Enríquez
O'Connor
Padovano
Palmirotta
Park
Paul A. Townsend
Pearson
Pemovska
Pereira
Perez
Petrakis
Petros Sfikakis
Planchard
Popovics
Porter
Pritchard
Pritchard
Rampášek
Rangel
Rebecca Fitzgerald
Rickardson
Rodriguez-Escudero
Roidl
Ross
Ruder
Rusnak
Russell Petty
Sahai
Sami
Santana-Codina
Schmidhuber
Schreuer
Seashore-Ludlow
Sethi
Shoemaker
Sideridou
Siolas
Sonali Narang
Steckel
Stone
Stransky
Su
Sueoka
Sun
Taghanaki
Talwar
Tan
Tan
Tentler
Theodore Sakellaropoulos
Tominaga
Tran
Triantaphyllou
Trilla-Fuertes
Turajlic
Turki
Tyner
Ulivi
van de Schoot
van der Maaten
van't Veer
Varmus
Vassilios Myrianthopoulos
Vassilis G. Gorgoulis
Vassilis Georgoulias
Wang
Wang
Wang
Wang
Wang
Weinstein
Weiss
Wu
Wu
Xu
Xu
Yamada
Yan
Yang
Yang
Yeh
Zhang
Zhang
Zhang
Zhao
Zhao
Zheng
Zhong
Zhong
Publication venue: 'Elsevier BV'
Publication date: 01/11/2019
Field of study

Crossref

The University of Manchester - Institutional Repository

University of Dundee Online Publications

Community assessment to advance computational prediction of cancer drug combinations in a pharmacogenomic screen

Author: Abante J
Abecassis BS
Aben N
Aghamirzaie D
Ahsen ME
Aittokallio T
Akhtari FS
Al-lazikani B
Alam T
Allam A
Allen C
Altarawy D
Alves V
Amadoz A
Anchang B
Angel Pujana M
Antolin AA
Ash JR
Ba-alawi W
Bagheri M
Bajic V
Ball G
Ballester PJ
Baptista D
Bare C
Bateson M
Bender A
Bertrand D
Boroevich KA
Bosdriesz E
Bougouffa S
Bounova G
Brouwer T
Bryant B
Bulusu KC
Calaza M
Calderone A
Calza S
Capuzzi S
Carbonell-Caballero J
Carlin D
Carter H
Castagnoli L
Celebi R
Cesareni G
Chang H
Chen G
Chen H
Chen H
Cheng L
Chernomoretz A
Chicco D
Cho K-H
Cho S
Choi D
Choi J
Choi K
Choi M
Coker E
Combinatio A-SD
Cortes-Ciriano I
Cserzo M
Cubuk C
Curtis C
Dang CC
de Almeida MP
De Cock M
de Esch I
de Graaf C
De Maeyer D
De Niz C
de Ruiter JR
De Troyer E
Di Veroli GY
Dijkstra T
Dopazo J
Draghici S
Drosou A
Dry JR
Dumontier M
Ehrhart F
Eid F-E
ElHefnawi M
Elmarakeby H
Engin HB
Evelo C
Falcao AO
Farag S
Fawell S
Fernandez-Lozano C
Fisch K
Flobak A
Fornari C
Foroushani ABK
Fotso DC
Fourches D
Friend S
Frigessi A
Gao F
Gao X
Garnett MJ
Gerold JM
Gestraud P
Ghazoui Z
Ghosh S
Gillberg J
Godoy-Lorite A
Godynyuk L
Godzik A
Goldenberg A
Gomez-Cabrero D
Gonen M
Gray H
Grechkin M
Guan Y
Guimera R
Guinney J
Guney E
Haibe-Kains B
Han Y
Hase T
He D
He L
Heath LS
Hellton KH
Helmer-Citterich M
Hidalgo MR
Hidru D
Hill SM
Hochreiter S
Hong S
Hovig E
Hsueh Y-C
Hu Z
Huang JK
Huang RS
Hunyady L
Hwang J
Hwang TH
Hwang W
Hwang Y
Isayev O
Jack J
Jahandideh S
Jang IS
Jeon M
Ji J
Jo Y
Kamola PJ
Kanev GK
Kang J
Karacosta L
Karimi M
Kaski S
Kazanov M
Khamis AM
Khan SA
Kiani NA
Kim A
Kim J
Kim J
Kim K
Kim K
Kim S
Kim Y
Kim Y
Kirk PDW
Kitano H
Klambauer G
Knowles D
Ko M
Kohn-Luque A
Kooistra AJ
Kuenemann MA
Kuiper M
Kurz C
Kwon M
Laegreid A
Lederer S
Lee H
Lee J
Lee YW
Leppaho E
Lewis R
Li J
Li L
Liley J
Lim WK
Lin C
Liu Y
Lopez Y
Low J
Lysenko A
Machado D
Madhukar N
Malpartida AB
Mamitsuka H
Marabita F
Marchal K
Marttinen P
Mason D
Mason MJ
Mazaheri A
Mehmood A
Mehreen A
Menden MP
Michaut M
Miller RA
Mitsopoulos C
Modos D
Moo K
Motsinger-Reif A
Movva R
Muraru S
Muratov E
Mushthofa M
Nagarajan N
Nakken S
Nath A
Neto EC
Neuvial P
Newton R
Nguyen T
Ning Z
Norman T
Oliva B
Olsen C
Palmeri A
Panesar B
Papadopoulos S
Park J
Park S
Park S
Pawitan Y
Peluso D
Pendyala S
Peng J
Perfetto L
Pirro S
Plevritis S
Politi R
Poon H
Porta E
Prellner I
Preuer K
Ramnarine R
Reid JE
Reyal F
Richardson S
Ricketts C
Rieswijk L
Rocha M
Rodriguez-Gonzalvez C
Roell K
Romeo Aznar V
Rotroff D
Rukawa P
Sadacca B
Saez-Rodriguez J
Safikhani Z
Safitri F
Sales-Pardo M
Sauer S
Schlichting M
Seoane JA
Serra J
Shang M-M
Sharma A
Sharma H
Shen Y
Shiga M
Shin M
Shkedy Z
Shopsowitz K
Sinai S
Skola D
Smirnov P
Soerensen IF
Soerensen P
Song J-H
Song SO
Soufan O
Spitzmueller A
Steipe B
Stolovitzky G
Suphavilai C
Szalai B
Tamayo SP
Tamborero D
Tang EKY
Tang J
Tanoli Z-U-R
Tarres-Deulofeu M
Tegner J
Thommesen L
Tonekaboni SAM
Tran H
Truong A
Tsunoda T
Turu G
Tzeng G-Y
Van Daele D
van Engelen B
van Laarhoven T
Van Moerbeke M
van Westen GJP
Verbeke L
Videla S
Vis D
Vogel R
Voronkov A
Votis K
Walk OBD
Wang A
Wang D
Wang H-QH
Wang P-W
Wang S
Wang W
Wang X
Wang X
Wennerberg K
Wernisch L
Wessels L
Westerman BA
White SR
Wijayawardena B
Willighagen E
Wolfinger R
Wurdinger T
Xie L
Xie S
Xu H
Yadav B
Yau C
Yeerna H
Yin JW
Yu M
Yu M
Yu T
Yun SJ
Zakharov A
Zamichos A
Zanin M
Zaslavskiy M
Zeng L
Zenil H
Zhang F
Zhang P
Zhang W
Zhao H
Zhao L
Zheng W
Zoufir A
Zucknick M
Publication venue
Publication date: 01/01/2019
Field of study

The effectiveness of most cancer targeted therapies is short-lived. Tumors often develop resistance that might be overcome with drug combinations. However, the number of possible combinations is vast, necessitating data-driven approaches to find optimal patient-specific treatments. Here we report AstraZeneca’s large drug combination dataset, consisting of 11,576 experiments from 910 combinations across 85 molecularly characterized cancer cell lines, and results of a DREAM Challenge to evaluate computational strategies for predicting synergistic drug pairs and biomarkers. 160 teams participated to provide a comprehensive methodological development and benchmarking. Winning methods incorporate prior knowledge of drug-target interactions. Synergy is predicted with an accuracy matching biological replicates for >60% of combinations. However, 20% of drug combinations are poorly predicted by all methods. Genomic rationale for synergy predictions are identified, including ADAM17 inhibitor antagonism when combined with PIK3CB/D inhibition contrasting to synergy when combined with other PI3K-pathway inhibitors in PIK3CA mutant cells.Peer reviewe

VU Research Portal

Publikationsserver der Universität Tübingen

Archivio istituzionale della ricerca - Università di Brescia

Leiden University Scholary Publications

UPF Digital Repository

NORA - Norwegian Open Research Archives

White Rose Research Online

CONICET Digital

Ghent University Academic Bibliography

Aaltodoc Publication Archive

Oxford University Research Archive

Repository of the Academy's Library

Apollo (Cambridge)

Diposit Digital de la Universitat de Barcelona

ScholarBank@NUS

Universidade do Minho: RepositoriUM

Ege University Institutional Repository

DI-fusion

Spiral - Imperial College Digital Repository

Helsingin yliopiston digitaalinen arkisto

Queen Mary Research Online

ART

Lirias

Maastricht University Research Portal

University of the South Pacific Electronic Research Repository

eScholarship - University of California

Semmelweis Repository

Archivio della ricerca- Università di Roma La Sapienza

A decision-theoretic approach to the evaluation of machine learning algorithms in computational drug discovery

Author: Cortes-Ciriano I
Taylor A
Watson J
Watson O
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2019
Field of study

Motivation: Artificial intelligence, trained via machine learning (e.g. neural nets, random forests) or computational statistical algorithms (e.g. support vector machines, ridge regression), holds much promise for the improvement of small-molecule drug discovery. However, small-molecule structure-activity data are high dimensional with low signal-to-noise ratios and proper validation of predictive methods is difficult. It is poorly understood which, if any, of the currently available machine learning algorithms will best predict new candidate drugs. Results: The quantile-activity bootstrap is proposed as a new model validation framework using quantile splits on the activity distribution function to construct training and testing sets. In addition, we propose two novel rank-based loss functions which penalize only the out-of-sample predicted ranks of high-activity molecules. The combination of these methods was used to assess the performance of neural nets, random forests, support vector machines (regression) and ridge regression applied to 25 diverse high-quality structure-activity datasets publicly available on ChEMBL. Model validation based on random partitioning of available data favours models that overfit and ‘memorize’ the training set, namely random forests and deep neural nets. Partitioning based on quantiles of the activity distribution correctly penalizes extrapolation of models onto structurally different molecules outside of the training data. Simpler, traditional statistical methods such as ridge regression can outperform state-of-the-art machine learning methods in this setting. In addition, our new rank-based loss functions give considerably different results from mean squared error highlighting the necessity to define model optimality with respect to the decision task at hand.</br

Oxford University Research Archive

A decision-theoretic approach to the evaluation of machine learning algorithms in computational drug discovery

Author: Cortes-Ciriano I
Taylor AR
Watson JA
Watson OP
Publication venue: 'Oxford University Press (OUP)'
Publication date: 09/05/2019
Field of study

Oxford University Research Archive

Temperature Accelerated Molecular Dynamics with Soft-Ratcheting Criterion Orients Enhanced Sampling by Low-Resolution Information

Author: Bouvier G.
Cortes-Ciriano I.
Malliavin T. E.
Maragliano L.
Nilges M.
Publication venue: 'American Chemical Society (ACS)'
Publication date: 01/01/2015
Field of study

Many proteins exhibit an equilibrium between multiple conformations, some of them being characterized only by low-resolution information. Visiting all conformations is a demanding task for computational techniques performing enhanced but unfocused exploration of collective variable (CV) space. Otherwise, pulling a structure toward a target condition biases the exploration in a way difficult to assess. To address this problem, we introduce here the soft-ratcheting temperature-accelerated molecular dynamics (sr-TAMD), where the exploration of CV space by TAMD is coupled to a soft-ratcheting algorithm that filters the evolving CV values according to a predefined criterion. Any low resolution or even qualitative information can be used to orient the exploration. We validate this technique by exploring the conformational space of the inactive state of the catalytic domain of the adenyl cyclase AC from Bordetella pertussis. The domain AC gets activated by association with calmodulin (CaM), and the available crystal structure shows that in the complex the protein has an elongated shape. High-resolution data are not available for the inactive, CaM-free protein state, but hydrodynamic measurements have shown that the inactive AC displays a more globular conformation. Here, using as CVs several geometric centers, we use sr-TAMD to enhance CV space sampling while filtering for CV values that correspond to centers moving close to each other, and we thus rapidly visit regions of conformational space that correspond to globular structures. The set of conformations sampled using sr-TAMD provides the most extensive description of the inactive state of AC up to now, consistent with available experimental information

IRIS UniversitÃ Politecnica delle Marche

A Novel Approach: Nanopore Sequencing of Native Cell-Free DNA in Diffuse-Large B-Cell Lymphoma Patients

Author: Cortes-Ciriano I
Erblich T
Ficz G
Gribben JG
Muyas F
Sauer C
Publication venue
Publication date: 01/11/2023
Field of study

Queen Mary Research Online